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(57) ABSTRACT 

An on-chip i/o -processor for controlling and communication 
with peripheral devices, wherein an i/o processor core (12), 
comprising at least one pin controller (29) for reading and 
setting physical i/o-pins (6), starting timers and generating 
interrupts for the i/o processor core (12), at least one timer 
(26) for sampling i/o-pins (6), setting i/o-pins (6) and 
generating interrupts for the i/o-processor core (12) at well 
defined points of time, said i/o-processor core (12) providing 
instructions for controlling said at least one pin controller 
(29), said at least one timer (26) and i/o-pins (6), an on chip 
RAM (3) holding instructions for the i/o processor core (12), 
at least one register (4) for exchanging information between 
the i/o-processor (2) and a connected CPU (1) on the same 
chip and inversely, configurable logic (5), connected 
between said core (12) and said i/o-pins (6), for synchroni- 
zation of incoming and/or outgoing signals. 

9 Claims, 10 Drawing Sheets 




07/31/2003, 



EAST Version: 



1 . 04 . 0000 



U.S. Patent Feb. 13, 2001 Sheet 1 of 10 

8 ~-^J Ext RAM 1 
I 



CPU 



DMA 



11 



FIFOs 
















US 6,189,052 Bl 



Off Chip 



On Chip 



Existing hardware 



fed 



configurable logic 



l/O-pins 



13 




15 



RS232 
Peripheral unit(s) 



1284 
Peripheral unit(s) 

FIG. 1 



ATA 
Peripheral unit(s) 



07/31/2003, EAST Version: 1.04.0000 



U.S. Patent Feb. 13, 2001 Sheet 2 of 10 US 6,189,052 Bl 



CO 



75 w 
(O CO 

II 




07/31/2003, EAST Version: 1.04.0000 



U.S. Patent Feb. 13, 2001 Sheet 3 of 10 US 6,189,052 Bl 



17 



19 



PC- 
Handler 


Register 
Bank 


ALU 


Command 
Register 



18 



/ FIFO, Buses j 



CPU 



20 



25 



8-bit compare value 



21 



8-bit mask register 



8-bit pin vector 
from 8 timers 



22 



FIG. 3 



p 



& 

T 

23 



Generat event 



24 



FIG. 4 



07/31/2003, EAST Version: 1.04.0000 



U.S. Patent Feb. 13, 2001 Sheet 4 of 10 US 6,189,052 Bl 



Timer register 



count 


value<14:0> 


-—27 






< > 

16 bit 

Timer extended register 


Event/IRQ 


mode<6:5> 


pin<4:0> 


• 28 



<_ > 

8 bit 



FIG. 5 



32 pins 



pin 
select 



+ 



Load new value 



Current 
timer value 



-1 



=0 



Every cycle 



Read pin 




Last pin 
value 



Set pin 
high / low 



Generate 
Event /IRQ 



FIG. 6 



07/31/2003, EAST Version: 1.04.0000 



U.S. Patent Feb. 13, 2001 Sheet 5 of 10 US 6,189,052 Bl 



30 



31 



Generate 
event 



■33 



32 pins 



pin 




Wait for pin to 


select 




go high or low 



Generate 
interrupt 



-32 



Start timer 



-34 



FIG. 7 



35 



Controller register 



on/off 


mode<1 1 :9> 


H/L 


timer<7:5> 


pin<4:0> 



13 bit 



FIG. 8 



07/31/2003, EAST Version: 1.04.0000 



U.S. Patent Feb. 13, 2001 Sheet 6 of 10 US 6,189,052 Bl 

11 



FIFOs 



FIFOs 


19 

i 


i 




I ► 


Register Bank, R0-R9 






t 




Physical Buses 




p» 



36, 37 

FIG. 9 



Start Data Data Data Data Data Data Data Data Stop 
bit bitO bit 1 bit 2 bit 3 bit 4 bit 5 bit 6 bit 7 bit 



MXMxIXlxIxlxr 
TTTTTTTTt 



38 



i i 
i i 
I I 

v 

39 



Sample points 



i i 
i i 
i i 



39 40 



FIG. 10 



07/31/2003, EAST Version: 1.04.0000 



U.S. Patent Feb. 13, 2001 Sheet 7 of 10 



US 6,189,052 Bl 



#0 nSelectln H 

#3 PError P 

Data<8..1> 

#1 nAutoFd H 

#2 nStrobe H 
#4 nAck P 



#5 Busy 



#6 n Fault 



#7 Select 



P 
P 



Point 0 1 



Mode 







i i 
i i 
i i 












• ! / 

Mi 

i i 

i i 
i i 














j — 














i i 
i i 
i i 
i i 




\ X 


jOOQO 0000 














i i 




! I 
1 












it— 








i 

t i 
i 










i 
i 
i 
i 
i 


\ 


— 


/ 
















r 
I 
I 










^^^^^ 






















iii i 










T P 










Tp l 





2 3 



4 5 6 









Compatability 


Negotiation 


Nibble 



Phase 



Mode 



FIG. 11 



07/31/2003, 



EAST Version: 



1.04.0000 



U.S. Patent Feb. 13, 2001 Sheet 8 of 10 US 6,189,052 Bl 




07/31/2003, EAST Version: 1.04.0000 



U.S. Patent 



Feb. 13, 2001 



Sheet 9 of 10 



US 6,189,052 Bl 



44 

a 



lnstr<23:0> 



45 

19 I 

\ b 



47 

I 

a 



48 



Instruction 
Decoder 



49 



Si 

CD. 

si 

0)1 

O. 



Instruction 
Decoder 



src 1 



ALU result 



46 

i 

I 
I 
I 

zerojflag 




ALU 
operation 



to PC handler 



FIG. 13 



50 




from event- and PC-select 



from branch- and IRQ-select 



FIG. 14 



07/31/2003, EAST Version: 1.04.0000 



U.S. Patent Feb. 13, 2001 Sheet 10 of 10 US 6,189,052 Bl 




07/31/2003, EAST Version: 1.04.0000 



HELD OF THE INVENTION 



US 6,189,052 Bl 
1 2 

ON-CHIP I/O PROCESSOR SUPPORTING accomplished by providing a multi protocol supporting 

DIFFERENT PROTOCOLS HAVING ON-CHIP i/o-processor according to claim 1. 

CONTROLLER FOR READING AND It is an object of the present invention to provide an 

SETTING PINS, STARTING TIMERS, AND i/o-processor, which replaces a number of dedicated i/o 

GENERATING INTERRUPTS AT WELL 5 controllers. 

DEFINED POINTS OF TIME Further, it is an object of the invention to provide an 

PRIORITY CLAIMED i/o-processor compatible with different CPUs. 

This application claims the benefit of priority to Swedish ., ob ) cci u °[ Reinvention is to provide a flexible 

Application No. 9704627-0, filed Dec. 11, 1997, entitled I/O 10 ^cessor with the ability to change i/o protocol without 

Processor, which is incorporated by reference as if fully set an ^ ar ware re " QSl ^ a ' 

forth herein. Still another object of the invention is to provide an 

i/o-processor providing shared pin for different protocols. 
Another further object of the invention is to provide an 

The present invention relates to a design and implemen- 15 i/o-processor enabling a very short design time for a new 

tation of an on-chip i/o-processor and more particularly an protocol. 

i/o-processor supporting different protocols. It is also an object of the invention to provide a simple 

DESCRIPTION OF THE PRIOR ART i/o-processor running at a high clock frequency. 

The main function of a computer involves its CPU and 2Q Further > it is an object of the present invention to provide 

main memory. The CPU fetches instruction and data from a ver y flcxible interface from the i/o-processor to the pins on 

the main memory, processes them, and stores the results in ^ ct "P> 1X - thc P ms should not have a ^ cd dedication to 

the main memory. Other important functions of the com- a s P ec *fi c protocol. 

puter are the i/o operations (input/output operations). These Thus, an advantage of the processor based imptementa- 

operations are performed by the i/o system, comprising i/o 25 ^on according to the invention is that the processor design 

devices or peripherals, control units for the devices, and is much more re-usable and allows for short design time for 

software to carry out the i/o operations. The purpose of the a new protocol, 
i/o system is to transfer information between the main 

memory or the CPU and the i/o devices. BRIEF DESCRIPTION OF THE DRAWINGS 

There are different kinds of prior art i/o systems available, 30 In order to explain the invention in more detail and the 
all of which systems utilize the CPU to some extent. If i/o advantages and features of the invention a preferred embodi- 
operations are completely controlled by the CPU the system ment will be described in detail below, reference being made 
is usually called a programmed i/o system. Programmed i/o to the accompanying drawings, in which 
can be implemented with very little dedicated i/o hardware. FIG. 1 is a schematical block diagram of an example of 
However, a programmed 1/0 system often suffers from 35 a system environment comprising an i/o-processor accord- 
spending a great deal of time performing trivial 1/0 opera- • t & t invendoa; 
tions. r 

r, tU . > * j -t ■ » r a /j ■ FIG. 2 is a schematical block diagram of an embodiment 

Other computer systems are provided with DMA (direct ft , ., . ri „ t 6 

\ i^t^a - j j "4 * 11. j of the i/o-processor in FIG. 1: 

memory access). DMA requires some additional hardware, r 

such as a DMA controller. In these systems the CPU is still 40 FIG> 3 fe a schematical block diagram of an i/o processor 

responsible for initiating block transfers, while the DMA 35 001(5 adding to the invention; 

system carries out the transfers without CPU interventions. FIG. 4 is a schematical view of an arrangement for pin 

In some prior art systems the i/o device or its controller is mask combination used for some protocols; 

provided with additional circuits for performing service FIG. 5 is a schematical view of timer registers; 

request, or interrupt, from the CPU. The interrupt capability 45 piG. 6 is a functional view of an i/o-processor timer; 

frees the CPU from polling i/o device status. Contrary to a FIG. 7 shows a principle of a controller of the invention; 

DMA request, an interrupt causes the CPU to switch pro- _„ _ . „ 

grams by saviog the state of its current program and "jump- FIG " 8 shows a 

ing" to an interrupt handling program. When the interrupt is FIG. 9 shows data transfer paths of the i/o-processor; 

completed the CPU can resume execution of the interrupted so FIG. 10 shows an RS-232 RxD signal; 

program. FIG. 11 is a timing diagram for the IEEE-1284 Negotia- 

A further step towards complete control of i/o operations tion Phase; 

is obtained by using an i/o-processor. like a DMA FIG> 12 & an embodiment of the i/o-processor pipeline 

controller, an i/o-processor has direct access to main steps (IF and EX); 
memory and can interrupt the CPU. However, it can also 55 

j • > 1 t*i_ 11 j ■/ FIG. 13 shows the different phases m the EX-step; 

execute programs directly. These so called i/o-programs can x „ „ . „ 

use an instruction set different from that of the CPU. , FIG ' 14 shows v ! he ste P s for of a P ro P er pc 

In prior art i/o systems the CPU and its memory are counlcr ) m the ™ han ^ er f . . 

connected to channels or i/o-processors. Each i/o-processor FIG ' 15 15 a schematical block diagram of pin instate and 

can accommodate a number of peripheral device controllers, 60 vanic registers. 

each controller controlling one or more identical, or similar, DETAILED DESCRIPTION OF THE 

devices such as line printers, laser printers, disks, CD-ROM INVENTION 
disks etc. 

An example of a system overview comprising an i/o- 

SUMMARY OF THE INVENTION 65 processor according to the present invention is shown in 

The present invention is directed to an improved input- FIG. 1. A CPU 1 is assisted by an i/o-processor 2 according 

output control system, supporting different protocols.. This is to the invention and together they form a flexible chip with 
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the ability to change i/o-protocol without any hardware interface follows the IEEE-1284 standard which supports 
re-design. The i/o-processor program code must be stored five different communication modes. The compatibility 
internal (on-chip) in for example a RAM 3, to maintain mode provides host-to-peripheral communication in a man- 
speed. A register 4 is used for sharing information between ner compatible with the commonly used unidirectional Cen- 
the CPU 1 and the i/o-processor 2. A configurable logic 5 5 tronics interface. The nibble and byte modes provide 
takes care of synchronization of incoming signals and makes peripheral-to-host communication and may be combined 
it possible to share i/o-pins 6 between different protocols. with the compatibility mode to form a bidirectional opera - 

The system in FIG. 1 is also provided with a DMA (direct ^on. In nibble mode data is sent as two sequential nibbles on 

memory access) controller 7, for carrying out block transfer &° status The ECP md EPP modes provide symmetric 

to/from an external RAM 8 without CPU interventions. 10 and asymmetric bidirectional communication, respectively, 

In this embodiment the CPU 1 is connected to the external bctwccn host and Peripheral. 

RAM 8 via a cache memory 9 and a bus 10. A set of FIFOs 1116 1284 electrical interface consists of a 25-pin D-shell 

11 forms an interface between the i/o-processor 2, or its i/o connector, IEEE 1284-A connector. The PC compatible 

processor core 12 and the CPU 1 connector is known as the classic 36-pin D-shell Centronics 

™ , .15 interface, e.g. IEEE-1284-B connector. The IEEE-1284-C 

The performance and size of silicon is increasing faster ' • i_ - L j « • . ^ 

than the development of different i/o-protocols (e.g. RS-232, » 'j"^ ^P* ™«f «™ « 

CPCT . t x mi • * *u * u u different interface status/data signals, not shown. Some of 

SCSI, Ethernet, etc). This suggests that it now would be . , , . ' , , . , 

■u 4 i. j w j- > i these signals are mentioned by its standard name in the 

possible to use processor based solutions even for protocols f n • h • t' 

that traditionally are built as hard custom modules. & P • 

~ , 4 - ., - j- * *l The IEEE-1284 protocol allows bidirectional data to be 

One embodiment of an i/o-processor 2 according to the . c , , . r , . , . , . „ , . r 

. . _ r 4 , iri r r transferred between a host and penpheral unit. Each transfer 

invention provides support for RS-232 protocol, IEEE-1284 , c , . * , . . . c , 

1 ^ a j at* a 1 t * mode consists of several states. When data is transferred 

protocol 14, and ATA protocol 15. c , . . . , . . . , . 4 , . 

r r from host to peripheral unit the interface is said to be in 

RS-232 is the standard for the electrical serial interface Forward Phase. When data is transferred the other direction 

between Data Terminal Equipment (DTE) and Data Connect 25 me interface jg m Reverse Phase. When the mode is to be 

Equipment (DCE). The DCE is typically a modem and the changed the interface is in its negotiation phase. 

D ^ J a r, P r C rn RS "f 2 ^ ^ ^ ^ ^'T^ C* 25 The interface may be in both Forward Idle and Reverse 

and HDLQ and asynchronous serial communications. Me phases waitmg fof ^ iQ bc ^ 

The RS-232 electacal interface consists of an 25-pin host drives the eight data lines and then pukes nStrobe low. 

"RS-23211 connector. All signals are not needed and there- 30 ^ pcripheral reS p 0 nds by setting busy high. When ready 

fore the PC/AT uses a 9-pin serial port instead of the 25-pin for ^ ncxt bytCj ^ peripheral pulses nAck low and then 

as the RS-232 interface. scts j ow ^ transfer rate of 10 Mb/s corresponds to a 

The RS-232 interface can work in synchronous mode if data transfer cycle time of 100 ns. At 100 MHz clocking 

additional signals are used. An SCT, Serial Clock Transmit, frequency this means 10 cycles for each data transfer, 

signal indicates the center of each bit transmitted by the 35 ATA, Advanced Technology Attachment, is the official 

terminal. The DCE samples data on each negative flank of name f or the IDE, Integrated Disk Electronics, interface, 

this signal. An SCR, Serial Clock Received, signal is the ATA was initially used as the hard disk controller for the 

corresponding timing signal from the DCE. ibm AT computer, but soon other manufacturers followed 

The most usual way to transfer data in the RS-232 ^ IBM and started to use this standard in their own products, 

protocol is in the asynchronous mode. In this mode each The first ATA standard was defined in 1994 and is known as 

byte is preceded by a start bit and followed by a stop bit. If the ANSI X3.T9.2 standard. This defined a 16-bit interface 

parity is used the last data bit is used as parity bit. Parity may standard to be used with hard disk drives. The standard 

be chosen to be even or odd. supports two hard disks on each cable. The standard has two 

The data speed between modem and terminal must be set 45 different data transfer modes, via programmed i/o (PIO) or 
to a specific value, for example 9600 bps. This would give dire ct memory access (DMA). Later the ATA-2, XTA-3 and 
a data transfer rate of 960 byte/s if no parity is used since ATA-4 standards were defined. The official ATA standard is 
each byte consists of ten bits. If parity is used the transfer followed by several popular names like Fast ATA, Ultra- 
rate decreases to 840 bytes/s since each 10-bit sequence now ATA, ATAPI, IDE and EIDE. 

only contains 7 data bits. When the receiver detects a start 50 The EIDE interface is not a formal ATA standard but 
bit, it starts an internal counter and depending on the defines a set of extensions to the original AEA standard, 
selected transfer speed it tries to sample the data bits in the Maximum data transfer rate of EIDE is 13.3 Mb/s. The 
center of each bit. The sampling points must not be exactly newest ATA standard is the ATA-4 standard which includes 
in the center of each bit, but the total drift at the last bit the presently widely used Ultra-DMA data transfer protocol 
should not exceed 10% of one bit length. At a transfer speed 55 and the ATAPI (ATA Packet Interface) CD-ROM interface, 
of 9600 bps each bit length is about 104 }ts. Hence the The ATA-3 electrical interface consists of a 40-pin con- 
sampling of the stop bit should occur at 884±10 /a& after the nector plus an additional 4-pin DC power connector. The 
start bit was sent. Higher transfer rates requires higher standard defines 31 interface status/data signals, 
precision and at 1 Mbps the precision of the last sample An ATA DMA transfer is always preceded by a sequence 
should be ±100 ns. At a clocking frequency of 100 Mhz 1 60 0 f PIO writes in the hard disk drive's 8-bit special registers. 
Mbps would correspond to 10 cycles of total drift of the This is done in order to set up the following DMA transfer, 
sampling system and a bit length of 100 cycles. Correspond- When the drive is ready for a DMA transfer it asserts the 
ing values are valid for the transmitter. The last bit sent must DMARQ signal. 

not drift more than 10% of the length of one bit. Th e host answers by asserting DMACK and DIOR for a 

The IEEE-1284 protocol is an asynchronous bidirectional 65 DMA read or DIOW for a DMA write. Data is latched on 

parallel communication protocol often used for connections each falling edge of DIOR/DIOW and should be valid in a 

between a terminal and printers or other peripherals. The well defined time interval. A DMA transfer is ended by the 
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peripheral by negating DMARQ before the host negates FIG. 2 shows a schematical block diagram of the archi- 

DIOR/DIOW during the last transfer The ATA cycle time is tecture of an embodiment of the i/o-processor and its i/o 

define as the time between two falling edges of DIOR/ processor core 12 in FIG. 1. 

DIOW. ATA-3 mode 2 supports a cycle time of 120 ns. The j^q ^re 12 is provided with four FIFO Controllers 16 

Ultra- ATA protocol differs from the standard ATA DMA 5 use d to handle the FIFOs 11. The FIFO Controllers 16 

transfer in such a way that the drive is allowed to control a update FIFO Status registers. They also read and write data 

transfer. This is done by adding four control signals to the blocks, where one block can be 8, 16, 24 or 32 bits. To make 

interface: HDMARDY Host DMA Ready; HSTROBE Host j t possible to check if there are more than a certain number 

Strobe; DDMARDY Device DMA Ready; DSTROBE of blocks to read or enough room for a certain number to 

Device Strobe. 10 write, it also contains two registers. One of these registers is 

In traditional ATA the host controls the data strobe sig- used to store a value to which the number of free blocks is 
naling (DIOR/DIOW). In Ultra-ATA the hard disk drive compared. This is continuously done and the result is 
controls the data strobe (DSTROBE) during a DMA read displayed in the FIFO status registers gt__free bit (value 
burst while the host controls the data strobe (HSTROBE) greater than free). The other register is used to store a value 
during a DMA write burst. This allows higher transfer 15 to which the numbers of available blocks to read is corn- 
speeds. Data is latched on both rising and falling edge of the pared. The result from this comparison is displayed in an 
strobe signal which doubles the data transfer speed com- It-avail bit (value less than avail). The instructions to these 
pared to traditional ATA DMA transfers. functions are shown in Table 1. 

The minimum cycle time in Ultra-ATA is specified to 55 

ns (mode 2). In a 100 MHz system this would correspond to 20 TABLE 1 

5.5 clock cycles. ■ r 

... j. . , t j . Instruction Description 

An i/o processor according to tne invention should be 



re-programmable to any general i/o protocol. The protocols LFF (Load value to FIFO Load the value that is compared to 

described herein are only examples. The design goal for Free > e« ater than ) (greater than) the number of free block in 

• • o o «c the FIFO 

RS-232 is four independent 1 Mbit/s channels. This would Tm , . „ . . A . h 

. , . . LFA (Load value to FIFO Load the value that is compared to (less 

require eight independent processes since each channel Avail, less than) than) the number of available block in 

consists of one transmitter and one receiver. The timing of the FIFO 

the signals has to be quite exact at higher transfer speeds. ^ 

There have to be ways to create pulses of a specific length ^ . ^ , „ x „ rt 

and ways to sample a sigoal at a specific time Further there 30 ^^"f 0 ' core 12 *. a \ 6 ' blt MSC <™?™*°">- 

has to be some kind of UART (Universal Asynchronous J* 16 " 18 ^ aS ' W e] ?™ d processor with 2 steps, 

Receiver Transmitter) circuit to convert serial to parallel descr * ed later - !" l ? ber ° f J plpe J 1 T StepS ? 3 C ° m f "i" 

data and vice versa mise Detween tne clock speed and the number of clock 

jutit] i io/4 * i o l * l. cycles for one^instruction to be finished. 

The IEEE-1284 protocol uses an 8-bit bus, wherein a bus „ „ . t „ , . , 

architecture is used to reach a transfer speed of 10 Mb/s. The 35 3 B * schematical block diagram lof ^the ^o-processor 

communication is bi-directional and therefore the data bus ™ re acc ^dmg to the invention. A PC-Handler 17 (PC- 

should also be bi-directional. The existence of different Pro S ram Cou nter) is used to manage all the interrupts and 

modes and multiple mode changes requires some sort of events - 11 ^ takes care of me branch illstrilctions - All 

commands between the main system (CPU) and the i/o An for the interrupt and event routines are stored in 

processor 40 the PC-Handler 17. These addresses are set by the instruc- 

™, A rr. A , , , A . . . tions LIR (Load Interrupt Register) and the LER (Load 

Ine A1A protocol has a cycle time around 60 ns and a c tD • * \ ah jj • *u v ^ 

- ; . , , , r i • event Register). All addresses m tne i/o-processor core 12 

high clocking frequency of the i/o system would facilitate ... A *w n m u- • * »• 

. , * 4 - r A-ri are 11 bits wide; this allows 2k on chip instruction memory, 

the implementation of ATA support. m,- • « ■ * * ■ 1 * *i j -u j / 

. t rr , . Irns is sumcient to implement at least the described proto- 

The use of a 16-bit bidirectional data line requires a 16-bit 45 ^3 ^4 ^ 

^ V S ' 7* ^* P^ 1 ™ of the wh ° le system, To enaW ; OTnditi0ffld bfanches the Vo . processor core U 

additional ^functions .like auto address mcrement to speed up ^ ^ ^ qt ^ q ^ ^ 

PIO transfers would be nice. In other protocols, such as m avton A a j fl„T /v n n ~\ tuJv •„ ^ k„ ?i at it id 

»l c?ncT 1 j c r *. . . -4 extended nag (X-flag). The Z-nag is set by an ALU 18 

the SCSI protocol, some data transfers are very time critical. /A t . & ^ T . 6/ TT . A . rtL ... . ™_ 

wn. j f • * * j • * * ^ iL (Arithmetic and Logic Unit) if the result is equal to zero. The 

Wnen data is transierred in synchronous mode rrom the 50 V a • ,l a iiir l-a- . * • i_ . • 1 * u 

• Wo1 ^ . . y v . , X-flag is set by the ALU shift instruction but is also set when 

peripheral unit, a strobe is signaling that data is valid and . 1 •/ • j t. L « • j 

u ul * . 1 1 rpn j ^^s^ 1 *f sensing a general 1/0 pin and when a timer buffer is read, 

should be latched The data is only valid a short time and the ^ } ^ u fc tQ be noticed 

1/0 processor must therefore immediately enter the right state ^ ^ iQ {he . ^ ^ branch have Qne 

111 ^cT n ° u ^ r ^ P ^ CCSSCS u C u m delay slot. This delay slot (the instruction after the branch 

parallel. SCSI is built around different phases and the 55 mstmction) ^ ^ys performed and should (if possible) be 

interface is often listening to a combination of two or more used ' r v 7 
signals. This is solved by some kind of bus architecture but 

it would be even better to allow the design to check or wait TABLE 2 

for different combinations of signals. 

In the following description, numerous Specific details, 60 Branch instruction Description 

such as address sizes, bus width, number of steps in the ' ' v ~ ™ 

^ ,1 j _ ■ * , . , . . j 4 . BRA (BRanch Always) Branch always, not conditional 

method, etc, are provided in detail in order to give more BEQ [ Branch tf E ^ Branch ilZr y fi&gBl 

thorough description of the present invention. It will be BNE (Branch if Not Equal) Branch if z-flag - 0 

obvious for those skilled in the art that the present invention BXE (Branch if X-flag Equal) Branch if X-flag - 0 

may be practiced without these specific details. Some well- 65 BXN (Branch if x-flag Not Equal) Branch if x-flag - 1 

known features are not described in detail so as not to make 
the present invention unclear. 



07/31/2003, EAST Version: 1-04.0000 



US 6,189,052 Bl 



The PC-Handler 17 also has three more instructions: 

1. The RTI (ReTurn from Interrupt) instruction which is 
used at the end of an interrupt routine. This restores the PC 
to the return address; it also restores the Z and X-flag. 

2. The CRA (Change Return Address) which allows the 
user to change interrupt return address, which is useful when 
an interrupt indicates that something wrong has occurred, 

3. The WTE (WaiT for Event) instruction which "pauses" 
the processor until an event or interrupt occurs. Notice that 
it is not recommended to use a WTE instruction inside an 
interrupt routine. 

A register bank 19 in the i/o-processor core contains 10 
general 16-bit wide registers. Since many protocols only use 
8 bits (1 byte) the registers can be split so that one 16 bit 
register could be used as 2 8 bit registers. This is done in the 
ALU 18 with the SWP (SWaP) instruction, which swaps the 
upper 8 and lower 8 bits. All ALU instructions can then be 
performed either at byte (lower 8 bits) or at word (all 16 
bits). The register bank 19 also handles the data into the 
ALU 18. Data into the ALU 18 could either be from the 
register bank 19 or from one of the following sources: 

Instruction code, immediate data, LRI (Load Register 
with Immediate); 

FIFO Data, F2R (FIFO to Register) and R2F (Register to 
FIFO); 

FTFO_Status, Each FIFO_Controller has a 16 bit wide 
status register that can be both written (SFS, Set FIFO 
Status) and read (CFS, Check FIFO Status); 

Bus data, B2R (Bus to Register) and R2B (Register to 
Bus); 

Command register 20, the register towards the CPU. Read 
with CCR (Check Command Register) and write with 
SCR (Set Command Register); and 

Pin register, a register with 14 pins, used check/sense 
several pins at the same time. 

Hie Command register 20, 16 bit, is used to exchange 
commands between the CPU 1 and the i/o processor 2 and 
inversely. This register is in fact two registers, one in which 
the CPU writes and one in which the i/o processor 2 can 
write. When the CPU Command Register is changed (e.g. 
when the CPU gives a command) an interrupt is generated 
to the i/o processor 2. An example of a command is the baud 
rate when a RS-232 channel is set up. 

The ALU 18 (Arithmetic and Logic Unit) has the most 
common ALU instructions. As described above, about the 
register bank 19, the ALU 18 can operate both on byte (8 
bits) and on word (16 bits). The Z-flag is set if the result is 
equal to zero. All ALU operations can be found in Table 3 



TABLE 4 



TABLE 3 


Instruction 


Description 


INC (INCrease) 


Increases with 1 


DEC (DECrease) 


Decreases with 1 


CMF (CoMPare) 


Sets Z-flag if equal 


AND 


Ordinary AND 


OR 


Ordinary OR 


NOT 


Ordinary NOT 


ROR (ROtate Right) 


Shifts in the X-flag from left The 




outshificd bit is put in the X-flag 


SWP (SwaP) 


Swaps the upper 8 bits with the lower 8 




bits 


MOV (MOVe) 


Copies one register 



The i/o-processor architecture is built around 16 interrupts 
and 4 events. There are several ways of generating interrupts 
and events, see Table 4. 



Source 


Interrupt 


Event 


Timer 


Yes, every lap 


Yes, every lap 


Controller 


Yes 


Yes 


Instr. code 


Yes 


No 


Several pins = bit pattern 


No 


Yes 



10 All interrupts and events have their own programmable 
instruction addresses, e.g. subroutine addresses. Because 
there are no queuing of interrupts or events, the signaling 
unit must keep signaling until it receives an acknowledg- 
ment. Queuing of signals should not be essential since the 

15 routines performed often are very short (4—5 instructions). 
Instead, priority is used to choose among interrupts or 
events. Interrupt 0 has the highest priority 15 has the lowest. 
Events work at a lower priority than all interrupts. 
When an interrupt occurs the flags and return addresses 

20 are stored and the executions continue at the corresponding 
interrupt address. Inside the interrupt routine, it is possible 
to change the return address if wanted; this is useful if an 
error occurs and it is necessary to abort and start over again. 
If an event occurs the execution continues at the corre- 

25 sponding event address. The events has no return address 
and do not store the flags. Events are not activated until the 
processor is in wait mode, using the WTE (Wait Event) 
instruction. 

The events are supposed to be used together with inter- 
30 rupts with very high priority which must not be locked out 
by non critical events. Less important signaling should be 
handled by the event handler which wili allow important 
interrupts to be handled immediately. A good example is 
protocols where the peripheral unit drives a request signal 
35 high to indicate that data is valid and may be latched in. If 
this is a very fast protocol (i.e. time critical) the i/o-processor 
2 must immediately enter the proper interrupt and read the 
data bus. 

The interrupt routine addresses are stored in 16 11 -bits 
40 interrupt registers. These are loaded by the LIR (Load 
Interrupt Register) instruction. The interrupt source and 
priority order are shown in Table 5. Interrupt 13 is a special 
interrupt, and it is activated by the instruction code INT. The 
use of this interrupt can for example be if you want to take 
45 advantage of the fact that the i/o processor does not allow 
interrupts in interrupts. Thus, if a very time critical operation 
is to be performed the INT instruction can be used to go into 
an interrupt routine and do the work there. 



50 



TABLE 5 



Prio rity, 0 - highest 



65 



0 


Timer 0 


1 


Controller 0 


2 


Timer 1 


3 


Controller 0 


4 


Timer 2 


5 


Controller 2 


6 


Timer 3 


7 


Controller 3 


8 


Timer 4 


9 


Controller 4 


10 


Timer 5 


11 


Controller 5 


12 


Timer 6 


13 


INT (Instruction Code) 


14 


Timer 7 


15 


CPU 
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TABLE 7-continued 


Mode 


Behavior 


6 


Generate Event + Generate IRQ 


7 


Generate Event + Start timer + Generate 




IRQ 
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Hie four event addresses are stored in four 11-bits event 
registers. These are loaded by the LER (Load Event 
Register) instruction. The four events may be selected from 
6 controllers, 7 timers or two pin masks by the LES (Load 
Event Select) instruction. As with the interrupts it is event-0 
that has the highest priority As shown in FIG. 4, the 
i/o-processor 2 has two 8-bit pin mask registers 21, and these 
registers are used together with the timer's pin select reg- 
isters 22. The mask registers 21 are used to select a com- Configurable logic takes care of the synchronization of 
bination of pins, by means of an AND-gate 23, which are 10 incoming signals and makes it possible to share pins 
compared, by means of a comparison component 24, with between different protocols. The logic works as an interface 
two 8-bit pin mask compare registers 25. This solution between the RISC core and the physical pins. But how 
makes it possible to wait for a combination of several signals should this configurable logic be designed? A first approach 
and then generate an event. This feature is not needed in the would be to make the connections as flexible as possible, 
three basic protocols 13, 14, and 15, but in other protocols 15 This means that each pin can be defined as a general i/o pin. 
(i.e. SCSI) it is very useful. To increase speed at bus operations such as a write operation 

The i/o-processor architecture contains 8 special 15-bit to the ATA data pins, some sort of bus should be defined. An 
timers 26, shown in FIG. 2. The value of the timer 26 is ideal solution would be to have the ability to configure any 
stored in a 16 bit register 27 (Timer register), shown in FIG. pi° to belong to a bus in no special order. This would 
5, wherein the 16:th bit is used to indicate whether the timer 20 unfortunately demand unrealistically large registers and 
26 is active or inactive, A timer counts continuously, until it multiplexors to build. Defining four 16-bits i/o buses on a 
is stopped, and generates an interrupt or event every round, cm P with 128 i/o pins would take up to 1500 memory 
It is allowed to change a timer's register value during a elements (e.g. flip flops) plus large multiplexors, 
count; in the next round the new value will be loaded into the A more realistic solution could be to have a number of 
timer. A timer can be started by a controller or from the 25 pre-defined buses that makes r/w from several pins in 
instruction code. parallel possible. These would be connected to pre-defined 

A second register, an 8 bit wide Timer extended register pins. Beside the buses there are a number of general i/o pins 
28, is used for the timers special functions. With this register read y to be used for handshaking and/or single data lines, 
it is possible to perform different features at every counted The logic may release a whole bus or a single i/o pin which 
lap. The msb in this register is used to select if an event 3 ° will result in a tristate mode and make inputs possible. 
(msb=l) or an interrupt (msb^O) will be generated. There are The i/o-processor 2 has 32 general i/o pins, two 8-bit 
4 modes for each timer, see Table 6; the different modes are buses 36 and two 16-bit buses 37. All pins and buses can be 
selected in the timer extended register 28. A functional view tristated (to enable input) with the instructions TRP 
of a timer is illustrated in FIG. 6. (TRistate Pin) and TRB (TRistate Bus). To configure a pin 

35 or bus for output the instructions NTP (Not Tristate Pin) and 
TABLE 6 NTB (Not Tristate Bus) are used. 

To enable fast data transfer between the buses 36, 37 and 
the FIFOs 11 the i/o-processor architecture allows direct 
transfers, as shown in FIG. 9. The instructions used are B2F 
40 (Bus to FIFO) and F2B (FIFO to Bus). It is also possible to 
modify data by leading the data through the Register Bank 
19, as described above. 

Some protocol problems are described and a solution of 
the problems is presented. The program that describes the 
45 protocol is written with assembler instructions. The i/o- 
An example of both the timers and controllers is described processor assembler then converts this ordinary text file to a 
^ ater t binary file. The i/o-processor assembler is able to handle 

The i/o-processor architecture also contains 6 specialized t o make it easier to write programs. In the following 

pin controllers 29, shown in FIG. 2. A controller contimi- sections a number of different problems are solved and 
ously senses one of 32 pins in a pin selector for high or low. 50 discussed. 

If the selected pin goes high or low, checked in a wait circuit Jn this sec tion a problem is solved with controllers and 
31, the controller 29 generates an interrupt 32 and/or event timers, showing the advantage of these blocks. The example 
33. The controller 29 also can start an i/o-processor timer 34. we ^ follow ^ a part D f me receiving procedure of 
A controller stops iteself when the programmed state is RS-232. Please remember that our design goal was 4 bidi- 
fulfilled. FIG. 7 shows the principle of a controller. 55 rec tional RS-232 channels at the same time. 

The behavior of the controllers is set in its bit controller Problem: When receiving RS-232 you wait for a start bit 
register 35. The difference modes are described in Table 7. 33 to comej af ter me sta rt bit you have to sample each data 

bit 39 in the middle of the bit. Finally a stop bit should be 

TABLE 7 detected 40. The last data bit (#7) could be used as a parity 

bit, but in this small example we do not show how to 
implement a parity check. 



Mode 


Behavior of timer, besides generating 
IRQ or event 


0 


None, ordinary timer 


1 


Samples a pin value and stores its value 




for later use 


2 


Sets the selected pin low 


3 


Sets the selected pin high 



Mode Behavior 



0 No effect (not used) Method: Load (do not start) a timer, the timers value shall 

1 Start timer (no IRQ) b e set to the number of i/o processor cycles equal to 1.5 

3 SES^o-™* IRQ bits. Tbc timer also shall be set to sample the 

4 Generate Event 65 RS-232, RxD signal. The sampled value of the RxD signal 

5 start timer + Generate Event is stored in a special 1 bit wide register, which can be read 

in the interrupt routine. 
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Then use an i/o-processor controller to sense for the start 
bit 38; the controller will start the timer and generate an 
interrupt. In the interrupt routine we setup a free register to 
store the byte being received. In the same interrupt routine 
we load the timer with the value equal to the length of one 5 
RS-232 bit. The timer now samples all data bits 39 exactly 
at the right time. In the timer interrupt routine we take care 
of the sampled bit 39 and increment a counter to decide if 8 
data bits 39 have been received. Because the timer self 
samples the data bit the time for the interrupt is not critical. 10 
When all 8 bits are received the timer should sample the stop 
bit 40 and then stop. In the Last interrupt routine the received 
byte can be put in the FIFO 11 to the CPU 1. This routine 
also reloads the timer value (with 1.5) and starts the con- 
troller to sense for another start bit, indicating another byte 15 
to be received. 

In this section a problem is solved with events. The 
example we will follow is a negotiation from Compatibility 
mode to Nibble mode in the IEEE-1284 protocol. 

Example: The difficulty when communicating in IEEE- 20 
1284 is the large number of pins (8) used when handshaking. 
This example strictly follows the timing diagram shown in 
FIG. 11. The number standing to the left of the pin name is 
the pin number. The letter H and P indicates if the signal is 
Host or Peripheral driven. In this example the AJAX is 25 
working as a Host. Finally the Tp is the minimum pulse for 
this protocol, this time is 500 ns. If we assume that the 
i/o-processor is clocked at 200 MHz this is equal to 100 
clock-cycles. Notice that the numbers at the bottom line 
indicate the different timing points. 30 

Method: Put 0000 0000 at the data bus (bus 0) (point 0), 
then load a timer to wait for Tp seconds. Then set nSelectin 
high and nAutoFd low (point 1); these signals are a request 
to the peripheral if it is an IEEE-1284 compatible unit. 

Start sensing the pins PError, nAck, nFault and Select for 35 
the response signature for IEEE-1284. At this point we also 
have to start a timer to count for a time-out if the peripheral 
does not respond. The maximum response time in IEEE- 
1284 is 35 ms (equals 7.000.000 i/o-processor cycles) but in 
this example we just set the timer to wait for 7000 cycles 40 
(equals 35 /is). 

If the peripheral responses correctly (point 2), we 30 pulse 
the nStrobe signal (point 3,4). We also stop the timer 
counting for time-out. 

Wait for nAck to go high (point 6), indicating that the 45 
peripheral has accepted the requested mode. 

The i/o-processor is mainly implemented in a 1 -phase 
clocking system, but area consuming register banks have 
been implemented as latches in a 2-phase system. The whole 
design is intended to be clocked at 200 MHz. The 1 -phase 50 
system was chosen since the clocking period is very short 
and the logic is often hard to divide into two equally delayed 
parts. Of course, time sharing between phases could have 
been used in a 2-phase system but would have required 
advanced clocking analyses. Our strategy facilitates the 55 
logic design work, but the timing restrictions of the registers 
are a bit more complex because write signals to the 2-phase 
latches have to be stable at the second phase. Another 
drawback is that the two phases are not loaded equally. This 
increases the risk of some clock-skew but modern technolo- 60 
gies has a well working clock tree architecture that often 
solves these problems. 

The instructions in the i/o processor architecture are 
chosen to be 24-bit words in order to hold general instruction 
coding plus general instruction coding. 65 

The instruction memory is preferably implemented as an 
on-chip RAM memory. The size may be up to 2048 words. 
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To support one of the RS-232, IEEE-1284 or ATA protocols 
the total size needed is less than 512 words. The timing of 
the RAM 3 has to fit into one cycle e.g. less than 5 ns. The 
RAM 3 can be loaded serially by the CPU 2 or by defining 
necessary pins in the architecture to be both RAM program- 
ming and general i/o pins. 

The i/o-processor core 12 is implemented as a two step 
pipeline, illustrated in FIG. 12. The first step is the IF 
(Instruction Fetch) step (or stage) 41, where the instructions 
are read in from the internal RAM 3. The second and last 
step is the EX (Execution) step (or stage) 42 where operands 
are fetched and executed in the ALU 18. All functions like 
pin access, FIFO access are also positioned in this second 
step 42. 

The instruction memory is located between two a-phase 
flip-flops 43, 44. The EX-step 42 is a bit special since the 
general registers are implemented as b -latches to save area. 
FIG. 13 illustrates the different phases in the EXstep 42. 

The execution is located between two b-phases 45,46, e.g. 
the register bank 19 and the flag registers, and divided by an 
a-phase 47 (latches). The registers are updated 1.5 cycles 
after the IF-step 41, which is allowed since the registers 19 
are not accessed until 0.5 cycle after the IF-step 41. The 
instructions pass a first and second instruction decoder 48,49 
in the EX-step 42 in order to control the different compo- 
nents of the EX-step 42. 

Branch decisions are taken in the second phase of the 
EX-step. This is one of the most time critical parts of the 
i/o-processor core 12. Hie PC handler logic 17 to decide 
whether a branch is to be taken or not and select the proper 
PC in less than 2.5 ns. 

The PC Handler 17 is divided in two different parts. A first 
PC Handler part is controlling the PC and branches and a 
second PC Handler part is the interrupt- and event-handler. 
The interrupt- and event-address registers are implemented 
asb-latches, again to save some precious area. The logic that 
handles the priority between interrupts and events is quite 
complex but hidden in BDS code. The IRP (Interrupt Return 
Pointer) is an 11-bit flip-flop and included in the PC Handler 
17. The selection of the PC is divided in two steps 50 and 51, 
illustrated in FIG. 14, to ensure that the timing demands will 
be fulfilled. 

The immediate- and the interrupt- addresses (IRQ) have to 
be selected by an own multiplexor 51. These are the most 
time critical signals. The PC++ signal is the incremented PC 
and is selected as default at normal instructions. The RTI 
instruction selects the IRP address at the first multiplexor 50 
and the normal at the second. The halt address is simply the 
same PC as in the last cycle. When an event occurs the select 
logic selects the event address. The design may be reset to 
any address, sent via the command register from the CPU. 

The register bank is implemented by b-latches to save 
area. Each 16-bit register is divided in two separate 8 -bit 
registers. This allows the instructions to write back the result 
to the lower 8-bit field without affecting the upper 8-bit field 
of a register. All external register sources are directly avail- 
able as registers according to Table 8. The right FIFO data 
and external bus data are selected by external 4x16 multi- 
plexors. 

TABLE 8 

Register # Source 

10 immediate 

11 FIFO data 

12 FIFO status 
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TABLE 8 -continued 



Register # 


Source 


13 


external bus data 


14 


command register 


15 


pin vector 



5 



The ALU is implemented as both BDS code and manually 
designed logic, Tlie result must be calculated in 2.5 ns, but 10 
this is not the major problem. The update of the Z-flag is the 
most critical path. It was not possible to build an incrementer 
or decrementer and afterwards check if the produced result 
should result in a Z-flag. The solution is to check if the 
operand to be incremented (or decremented) is equal to 15 
Oxffffffff (or 0x00000001 for the decrement operation). 
These values should result in a set Z-flag and the test is 
simply an 8 -bit AND gate plus inverters. This is a much 
quicker test than the increment plus compare operations. 

The final implementation of the i/6 processor 2 contains 2 o 
8 timers. The timers are built as separate blocks, shown in 
FIG. 6. The time'r blocks include the two registers in each 
timer, the registers are implemented as flip-flops. A timer is 
implemented between two a-phases. The timers event and 
interrupt signals are connected to the PC-Handler. If the 25 
timer is set to sample a bit the sampled result is stored in a 
flip-flop. This flip-flop could be read by the instruction RTB 
(Read Timer Buffer). The pin selected by the timer extended 
register is connected to the pin register in the register bank 
19, but also to the mask register 21. Table 9 shows the pins 30 
in the pin vector 22. 

There are 6 Pin Controllers 29 in the i/o-processor imple- 
mentation. Like the timers the controllers are built as sepa- 
rate blocks, between two a-phases. The controller includes 
the controller register, which is implemented with flip-flops. 35 
A description over a controller is illustrated in FIG. 7. The 
pin selected in the controller is included in the pin vector in 
the register bank, see Table 9. 



TABLE 9 



Pin register bit 


Source 


Pin register bit 


Source 


0 


Timer 0 


8 


Controller 0 


1 


Timer 1 


9 


Controller 1 


2 


Timer 2 


10 


Controller 2 


3 


Timer 3 


11 


Controller 3 


4 


Timer 4 


12 


Controller 4 


5 


TunerS 


13 


Controller 5 


6 


Timer 6 






7 


Timer 7 







45 



All pins and buses may be tristated individually. Tristate 
data and pin values are stored in special registers like in FIG. 
15. 

A zero in the tristate register makes the pin work as an 
output and a one as an input. It should be apparent that the 55 
present invention provides an improved inputoutput control 
system, supporting different protocols. Although the inven- 
tion has been described in conjunction with a specific 
embodiment thereof, this invention is susceptible of embodi- 
ments in different forms, with the understanding that the 60 
present disclosure is to be considered as an exemplification 
of the principles of the invention and is not intended to limit 
the invention to the specific embodiment illustrated. 

What is claimed is: 

1. An on-chip i/o-processor for controlling and commu- 65 
nication with peripheral devices, characterized by an i/o- 
processor core (12), comprising at least one pin controller 
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(29) for reading and setting physical i/o-pins (6), starting 
timers and generating interrupts for the i/o-processor core 
(12), at least one timer (26) for sampling i/o-pins (6), setting 
i/o-pins (6) and generating interrupts for the i/o-processor 
core (12) at well defined points of time, said i/o-processor 
core (12) providing instructions for controlling said at least 
one pin controller (29), said at least one timer (26) and 
i/o-pins (6), an on chip RAM (3) holding instructions for the 
i/o-processor core (12), at least one register (4) for exchang- 
ing information between the i/o-processor (2) and a con- 
nected CPU (1) on the same chip and inversely, configurable 
logic (5), connected between said core (12) and said i/o-pins 
(6), for synchronization of incoming and/or outgoing sig- 
nals. 

2. An on-chip i/o-processor according to claim 1, char- 
acterized in that said configurable logic further comprises 
pin selecting means (30) connected to pin level comparing 
means (31), for generating events (33), interrupts (32) or 
timer start signals (34) (FIG. 7). 

3. An on chip i/o processor according to claim 1, char- 
acterized by 32 i/o pins connected to said configurable logic, 
and two 8-bit buses and two 16-bit buses connected to the 
core and FIFOs. 

4. An on-chip i/o-processor according to claim 3 charac- 
terized by an i/o-processor architecture 35 enabling fast data 
transfer by allowing direct transfers between the buses (36, 
37) and the FIFOs (11) (FIG. 9). 

5. An on-chip i/o processor according to claim 1, wherein 
RS-232, IEEE- 1284, ATA, and SCSI protocols are sup- 
ported. 

6. An on-chip i/o-processor according to claim 1, char- 
acterized in that the i/o-processor has 8-bit pin mask 
registers, and these registers are used together with pin select 
registers, wherein the mask registers are used to select a 
combination of pins, by means of an AND -gate, which are 
compared, by means of a comparison component, with 8 -bit 
pin mask compare registers, enabling the i/o-processor to 
wait for a combination of several signals and then generate 
an event. 

7. An on-chip i/o-processor according to claim 1, char- 
acterized in that the i/o-processor core is a two step pipeline, 
wherein the first step is an IF step, wherein the instructions 
are read in from the internal RAM, and the second step is an 
EX step, wherein operands are fetched and executed in the 
ALU. 

8. A method for controlling peripheral devices using the 
i/o-processor according to claim 5, characterized in that a 
receiving procedure of RS-232 signals comprise the steps of: 

loading a timer with the number of i/o processor cycles, 

and a value in order to sample an RS-232, RxD signal; 
storing said sampled value of the RxD signal in a special 

1 bit wide register, readable in an interrupt routine; 
using an i/o-processor controller to sense for a start bit, 

wherein the controller will start the timer and generate 

an interrupt; 

setting up a free register to store the byte being received; 
loading the timer with the value equal to the length of one 
RS-232 bit; 

sampling all data bits at the right time by the timer; 
incrementing a counter to decide if the predetermined 

number of data bits has been received; 
when all the data bits are received sampling the stop bit 

and then stop; 
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in the last interrupt routine putting the received byte in the 

FIFO for the CPU; 
reloading the timer value; and 

starting the controller to sense for another start bit, 
indicating another byte to be received. 5 

9. A method for controlling peripheral devices using the 
i/o -processor according to claim 5, characterized in that a 
negotiation from Compatibility mode to Nibble mode in the 
IEEE- 1284 protocol comprises the steps of: 

putting 0000 0000 at the data bus; 10 

loading a timer to wait for Tp seconds; 

setting nSelectin high and nAutoFd low; 
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sensing the pins PError, nAck, aFault and Select for the 
response signature for IEEE-1284; 

starting a timer to count for a time-out if the peripheral 
does not respond; 

if the peripheral responds correctly, pulsing the nStrobe 
signal; 

stopping the timer counting for time-out; and 

waiting for nAck to go high, indicating that the peripheral 
has accepted the requested mode. 

***** 



07/31/2003, EAST Version: 1.04.0000 



